Full-Network Embedding in a Multimodal Embedding Pipeline

نویسندگان

Armand Vilalta

Dario Garcia-Gasulla

Ferran Parés

Eduard Ayguadé

Jesús Labarta

Ulises Cortés

Toyotaro Suzumura

چکیده

The current state-of-the-art for image annotation and image retrieval tasks is obtained through deep neural networks, which combine an image representation and a text representation into a shared embedding space. In this paper we evaluate the impact of using the Full-Network embedding in this setting, replacing the original image representation in a competitive multimodal embedding generation scheme. Unlike the one-layer image embeddings typically used by most approaches, the Full-Network embedding provides a multi-scale representation of images, which results in richer characterizations. To measure the influence of the Full-Network embedding, we evaluate its performance on three different datasets, and compare the results with the original multimodal embedding generation scheme when using a one-layer image embedding, and with the rest of the state-of-the-art. Results for image annotation and image retrieval tasks indicate that the Full-Network embedding is consistently superior to the one-layer embedding. These results motivate the integration of the FullNetwork embedding on any multimodal embedding generation scheme, something feasible thanks to the flexibility of the approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Steganalysis of embedding in difference of image pixel pairs by neural network

In this paper a steganalysis method is proposed for pixel value differencing method. This steganographic method, which has been immune against conventional attacks, performs the embedding in the difference of the values of pixel pairs. Therefore, the histogram of the differences of an embedded image is di_erent as compared with a cover image. A number of characteristics are identified in the di...

متن کامل

Link Prediction using Network Embedding based on Global Similarity

Background: The link prediction issue is one of the most widely used problems in complex network analysis. Link prediction requires knowing the background of previous link connections and combining them with available information. The link prediction local approaches with node structure objectives are fast in case of speed but are not accurate enough. On the other hand, the global link predicti...

متن کامل

Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models

Inspired by recent advances in multimodal learning and machine translation, we introduce an encoder-decoder pipeline that learns (a): a multimodal joint embedding space with images and text and (b): a novel language model for decoding distributed representations from our space. Our pipeline effectively unifies joint image-text embedding models with multimodal neural language models. We introduc...

متن کامل

Detecting Overlapping Communities in Social Networks using Deep Learning

In network analysis, a community is typically considered of as a group of nodes with a great density of edges among themselves and a low density of edges relative to other network parts. Detecting a community structure is important in any network analysis task, especially for revealing patterns between specified nodes. There is a variety of approaches presented in the literature for overlapping...

متن کامل

Embedding measure spaces

‎For a given measure space $(X,{mathscr B},mu)$ we construct all measure spaces $(Y,{mathscr C},lambda)$ in which $(X,{mathscr B},mu)$ is embeddable‎. ‎The construction is modeled on the ultrafilter construction of the Stone--v{C}ech compactification of a completely regular topological space‎. ‎Under certain conditions the construction simplifies‎. ‎Examples are given when this simplification o...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

CoRR

دوره abs/1707.09872 شماره

صفحات -

تاریخ انتشار 2017

Full-Network Embedding in a Multimodal Embedding Pipeline

نویسندگان

چکیده

منابع مشابه

Steganalysis of embedding in difference of image pixel pairs by neural network

Link Prediction using Network Embedding based on Global Similarity

Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models

Detecting Overlapping Communities in Social Networks using Deep Learning

Embedding measure spaces

عنوان ژورنال:

اشتراک گذاری